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Abstract 

Background: The Q151M multi-drug resistance (MDR) pathway in HIV-1 reverse transcriptase (RT) confers reduced 
susceptibility to all nucleoside reverse transcriptase inhibitors (NRTIs) excluding tenofovir (TDF). This pathway emerges 
after long term failure of therapy, and is increasingly observed in the resource poor world, where a nti retroviral therapy 
is rarely accompanied by intensive virological monitoring. In this study we examined the genotypic, phenotypic and 
fitness correlates associated with the development of Q151M MDR in the absence of viral load monitoring. 

Results: Single-genome sequencing (SGS) of full-length RT was carried out on sequential samples from an HIV- 
infected individual enrolled in ART rollout. The emergence of Q151M MDR occurred in the order A62V, V75I, and 
finally Q151M on the same genome at 4, 17 and 37 months after initiation of therapy, respectively. This was 
accompanied by a parallel cumulative acquisition of mutations at 20 other codon positions; seven of which were 
located in the connection subdomain. We established that fourteen of these mutations are also observed in 
Q151 M-containing sequences submitted to the Stanford University HIV database. Phenotypic drug susceptibility 
testing demonstrated that the Q1 51 M-containing RT had reduced susceptibility to all NRTIs except for TDF. RT 
domain-swapping of patient and wild-type RTs showed that patient-derived connection subdomains were not 
associated with reduced NRTI susceptibility. However, the virus expressing patient-derived Q151M RT at 37 months 
demonstrated -44% replicative capacity of that at 4 months. This was further reduced to -22% when the Q151M- 
containing DNA pol domain was expressed with wild-type C-terminal domain, but was then fully compensated by 
coexpression of the coevolved connection subdomain. 

Conclusions: We demonstrate a complex interplay between drug susceptibility and replicative fitness in the 
acquisition Q151M MDR with serious implications for second-line regimen options. The acquisition of the Q151M 
pathway occurred sequentially over a long period of failing NRTI therapy, and was associated with mutations in 
multiple RT domains. 



Background 

RT inhibitors (RTIs) are the mainstay of combination 
antiretroviral therapy (cART). Recommended first-line 
therapy regimens for HIV-1 treatment usually comprise 
two nucleos(t)ide RTIs (NRTIs) plus a third agent, 
either a non-nucleoside RTI (NNRTI) or a boosted 
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protease inhibitor (bPI) or integrase inhibitor [1-3]. 
More than 90 mutations have been identified in HIV-1 
RT to be associated with resistance to RTIs, and the 
majority are clustered either around the polymerase 
active site or the hydrophobic binding pocket of 
NNRTIs in the DNA pol domain of RT [4-7]. A conse- 
quence of some of these mutations is a severe loss of 
viral replicative capacity which can subsequently be 
restored by compensatory mutations elsewhere within 
RT [8]. 
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The Q151M MDR is important because it has been 
shown to confer resistance to almost all NRTIs with the 
exception of TDF [9]. The Q151M MDR complex is 
composed of the Q151M mutation, which is normally 
the first to appear, followed by at least two of the fol- 
lowing four mutations: A62V, V75I, F77L and F116Y 
[10]. The Q151M MDR complex was initially described 
to develop during long-term NRTI-containing combina- 
tion therapy or NRTI therapy with zidovudine (AZT) 
and/or didanosine (ddl) [11,12]; however, it is now 
rarely observed in resource-rich countries, where more 
potent cART is used. It is believed that the Q151M 
MDR complex occurs infrequently because the Q151 to 
M mutation requires a 2-bp change (CAG to ATG), and 
the two possible intermediate changes of Q151L (CAG 
to CTG) and Q151K (CAG to AAG) significantly reduce 
viral replication capacity in vitro and are seldom 
observed in vivo [13-15]. The replicative capacity of a 
Q151L-containing virus was shown to improve in the 
presence of S68G and M230I mutations suggesting that 
compensatory mutations could favour the emergence of 
the Q151M MDR complex [13,15]. 

The Q151M complex has been identified in up to 
19% of patients failing therapy containing stavudine 
(d4T) as part of ART rollout in the developing world, 
particularly where treatment is given without virologi- 
cal monitoring, thus allowing long term viraemia 
whilst on first-line therapy [16-18]. This includes the 
CHAP2 (Children with HIV Antibiotic Prophylaxis) 
prospective cohort study of Zambian children on a 
first-line therapy of lamivudine (3TC)/d4T/nevirapine 
(NVP) where 2 out of 26 children (8%) for whom resis- 
tance data were obtained had developed resistance via 
this pathway [19]. 

Although mutations causing resistance to RTIs have 
been shown to occur mainly in the DNA pol domain of 
RT, recent studies have implicated mutations in the C- 
terminal region of RT in resistance and possibly in 
restoring replication fitness of the HIV-1 drug-resistant 
variants [20,21]. Some of these mutations, such as 
N348I in the connection subdomain, have been reported 
to have a prevalence of 10-20% in treatment-experi- 
enced individuals [22]. The N348I mutation is associated 
with M184V and TAMs, and increases resistance to 
NRTIs such as AZT, as well as the NNRTI NVP. N348I 
confers resistance by reducing RNase H activity which 
allows more time for the excision or dissociation of the 
RT inhibitors [22-27]. However, few data are available 
on the evolution and genetic linkage of C-terminal 
mutations in the context of Q151M MDR complex, 
especially in non-B subtypes. In this study, we per- 
formed a detailed analysis of sequential samples col- 
lected from a patient in the CHAP2 cohort study who 



had developed resistance via the Q151M pathway to dis- 
sect the intrapatient viral population dynamics in the 
context of full-length RT. 

Results 

We investigated the emergence of the Q151M MDR 
complex in one of the two patients in the CHAP2 
cohort study who had developed resistance via the 
Q151M pathway [19]. The patient, designated P66, was 
infected with HIV-1 subtype C virus. 

Dynamics of emergence and genetic linkage of Q151M 
MDR complex mutations 

Patients enrolled in the CHAP2 cohort study had CD4 
counts done approximately every 6 months and plasma 
was stored for retrospective viral load and genotypic 
testing. For patient P66, six samples were collected at 0, 
4, 10, 17, 28, and 37 months after initiation of therapy; 
four of which were available for viral load testing and 
SGS analysis. The viral load and CD4% counts for 
patient P66 are shown in figure 1. We initially deter- 
mined the development of Q151M MDR complex using 
SGS of full-length RT gene in the four sequential sam- 
ples collected from patient P66 at 4, 17, 28 and 37 
months. More than 30 single-genome sequences were 
generated per time point except for the 4- and 28- 
month time points when we obtained 6 and 0 sequences 
respectively. Genetic linkage analysis of the single gen- 
omes at 4, 17 and 37 months showed that the patient 
acquired the Q151M MDR mutations in the order: 
A62V, V75I and finally Q151M (Table 1). The emer- 
gence of Q151M after the secondary mutations A62V 
and V75I is rare. In addition, the analysis showed that 
drug resistance mutation T69N was genetically linked to 
Q151M MDR mutations and was acquired prior to 
Q151M. 

Accessory mutations in the DNA pol domain of RT 
have previously been demonstrated in the route to 
acquisition of Q151M MDR complex in subtype B 
viruses [12,28]. We, therefore, determined whether 
accessory mutations developed in this subtype C HIV-1 
virus and whether the C-terminal region of RT played a 
role in the emergence of the Q151M MDR complex. 
The emergence and presence of mutations in DNA pol 
domain, connection subdomain and RNase H domain 
were assessed by SGS, and their genetic linkage to 
Q151M MDR mutations was determined. A pre-treat- 
ment sample was not available for analysis from patient 
P66; therefore a codon change was scored as a mutation 
if it met one of the following criteria: (i) if it was a 
known drug resistance mutation as determined by Inter- 
national AIDS Society-USA (IAS-USA) [29], (ii) if it was 
not present in sequences from a previous time point or 
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Figure 1 Clinical profile of patient P66 Longitudinal viral load, 
CD4% and ART regimen data for patient P66 during a 3-year follow 
up period starting from initiation of cART. 



underwent a significant change in frequency between 
time points. This analysis showed a cumulative increase 
in mutations in all RT domains (Table 1). Mutations 
were identified at 12 codon positions in DNA pol 
domain, namely, 31, 33, 48, 68, 102, 123, 135, 174, 197, 
202, 203 and 314; seven in connection subdomain, 357, 
371, 386, 399, 403, 458 and 471; and one in RNase H 
domain, 517. The correlation between the progressive 
increments in the frequency of these mutations and the 
sequential acquisition of the Q151M MDR mutations 
suggested that they could be facilitating the emergence 
of the Q151M MDR complex. This notion is further 
supported by the observation that 18 out of the 20 
mutations were present in a majority of the single gen- 
omes by 37 months and nearly half of them were pre- 
sent in all the single genomes (Table 1). 

The Q151M MDR mutations were also genetically 
linked to NRTI mutations M184IV and L210F, and 
NNRTI mutations E138A, Y181I and H221Y (Table 1). 
Of note, the N348I mutation was identified in the con- 
nection subdomain of all single genomes at 4 months. 
However, the mutation was present in only one out of 
33 single genomes at 17 months but none of the 31 sin- 
gle genomes at 37 months when the Q151M mutation 
emerged (Table 1). 

Intrapatient viral genetic diversity in the route to 
acquisition of Q151M MDR complex 

The evolution and viral population dynamics within 
patient P66 were examined further by phylogenetic 



analyses. Maximum likelihood (ML) trees of the PR-RT 
single-genome sequences generated from the sequential 
samples of the patient are shown in Figure 2A. In gen- 
eral, the ML-inferred genealogy clustered all single gen- 
omes from each time point within a monophyletic clade 
with corresponding progressive increases in genetic dis- 
tances. Intriguingly, the analyses also showed a serial 
replacement effect with sequences from successive time 
points arising from a single branch of a cluster of 
sequences from a preceding time point. This suggests a 
serial founder effect in the development of Q151M 
MDR. Furthermore, ML-inferred genealogy of the 
sequences with drug resistance codons removed showed 
that the serial founder effect and monophyletic cluster- 
ing of the sequences from each time point was main- 
tained (Figure 2B). This indicates that the identified 
accessory mutations could be playing an important role 
in the evolution and development of the Q151M MDR. 

High prevalence of some of the identified accessory 
mutations in subtype B and C infected patients 

Next, we determined if the 20 accessory mutations that 
we identified in patient P66 were present in other 
patients who had developed resistance via the Q151M 
pathway. We compared mutation frequencies in subtype 
B or C samples from RTI-treatment naive patients and 
Q151M-containing patient samples on the Stanford Uni- 
versity HIV drug resistance database. A significant num- 
ber of sequences (15 to 12,361) were available for 
analysis in each subgroup, except for connection subdo- 
main and RNase H domain of Q151M-containing sub- 
type C sequences, in which there was only one sample 
sequenced beyond the DNA pol domain. Therefore, the 
analysis for subtype C sequences could only be carried 
out for the DNA pol domain. This showed that eight out 
of the 12 codon positions identified in the DNA pol 
domain of patient P66 were significantly associated with 
the sequences containing the Q151M mutation com- 
pared to RTI-treatment naive sequences. These codon 
positions were 31, 33, 48, 68, 123, 174, 202 and 203 (P < 
0.042; Table 2). In contrast, two of these codon positions, 
namely 48 and 174, were not associated with the acquisi- 
tion of Q151M in subtype B infected patients, but an 
additional two others were, namely 102 and 197 (P < 
0.029). Interestingly, codon positions 386 and 403 in con- 
nection subdomain were also significantly associated with 
the acquisition of Q151M in subtype B infected indivi- 
duals (P < 0.018). These data indicate that some of the 
accessory mutations identified in the DNA pol domain 
and connection subdomain of patient P66 are highly pre- 
valent in patients who develop resistance through the 
Q151M pathway and that they could be playing an 
important role in the acquisition of the Q151M MDR. 
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Table 1 The sequential acquisition of Q151M MDR mutations and the frequency of other RT mutations linked to MDR 
mutations, in patient P66. 



Type or Location of mutations 
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Genetic 1 
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V 84 
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a Wild-type residue was determined based on 4-month sequences and frequency in treatment-naive individuals as determined using Stanford University HIV 
database 

b Viral load in copies/mL 

c Number of single genomes linked or unlinked to Q151M MDR mutations 

d Percent of single genomes with that particular mutation calculated as follows: number of mutations per codon/number of single genomes linked or unlinked to 
Q151M MDR (n) x 100% 



C-terminal mutations are not associated with decreased 
susceptibility of Q1 51 M-containing viruses to NRTIs in 
patient P66 

Consequently, we investigated whether the C-terminal 
mutations we observed affected susceptibility to NRTIs. 
Unique restriction sites were introduced in RT and IN 
genes without changing the amino acid coding, in both 
the packaging vector and cloned patient fragments in 
order to facilitate RT domain-swapping (Figure 3A). 
The patient-derived RTs remained d4T-susceptible until 



the development of the Q151M mutation at 37 months, 
when there was a significant increase (~16-fold) in IC 50 
values compared to wild-type RT (Figure 3B; P < 0.002). 
At most we observed a 1.3-fold change in susceptibility 
to d4T at 4 or 17 months leading us to conclude that 
Q151M is the main contributor to d4T resistance in the 
Q151M MDR complex. The patient-derived RT exhib- 
ited a 23-fold increase in 3TC IC 50 values at 4 months 
which did not increase at 17 and 37 months despite the 
acquisition of the Q151M MDR mutations (Table 3). 
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Figure 2 ML phylogenetic analysis of single genome sequences. Branch lengths were estimated using the GTR model of substitution and 
are drawn in scale with the bar at the bottom representing 0.008 nucleotide substitutions per site. The colour of each tip branch represents the 
time after initiation of therapy when the sample from which the single-genome originates was collected as shown in the legend in each figure. 
(A) Phylogenetic tree of 70 single genomes generated from 3 sequential samples from patient P66 infected with subtype C HIV-1 virus. (B) Same 
as (A) but with the following 12 RT drug resistance codons removed from the aligned single-genome sequences to determine the effect of drug 
resistance mutations on viral evolution: 62, 69, 75, 90, 138, 151, 181, 184, 210, 221, 230 and 348. The trees were rooted using the subtype C 
reference sequence MJ4. 



The effect on susceptibility to 3TC was probably due to 
M184I/V mutations which were seen by 4 months. The 
23-fold reduction in susceptibility is relatively lower 
than observed in other studies [30,31]- This could be 
because our assay uses full-length RT fragments derived 
from clinical isolates. It has recently been shown that 
the use of a co-evolved or subtype-specific C-terminal 
region of RT can influence the magnitude of drug resis- 
tance observed in a phenotypic drug susceptibility assay 
[32]. 

Analysis of susceptibilities of patient-derived RTs to 
the CHAP2 second-line NRTIs ddl and ABC showed a 
cumulative decrease in susceptibility in the order; 1.2- 
and 1.7-fold at 4 months, 4- and 6-fold at 17 months, 
and finally 9.9- and 10.8-fold at 37 months, respectively 
(Figure 3C). Thus, unlike d4T the cumulative acquisition 
of mutations on the route to Q151M MDR complex 
results in a parallel cumulative decrease in susceptibil- 
ities to ABC and ddl. In addition, the recombinant 
viruses expressing patient-derived RTs exhibited 
decreased susceptibilities to NRTIs FTC of >79-fold at 4 
months and AZT of >15-fold at 37 months (Table 3) 
but remained susceptible to TDF even after the acquisi- 
tion of the Q151M mutation at 37 months (Figure 3D) 
with no significant increases in IC 50 values (P > 0.18). 
The susceptibility to TDF could probably be influenced 
by the presence of M184V which has been shown to 
increase HIV-1 sensitivity to TDF [33,34]. 

The expression of the patient-derived DNA pol 
domain at 37 months plus wild-type C-terminal region 



or coevolved connection subdomain showed no signifi- 
cant differences in IC 50 values to d4T (P > 0.05) sug- 
gesting that none of the identified C-terminal mutations 
in patient P66 at 37 months contributed to the reduc- 
tion in susceptibility to d4T (Figure 3B). Similarly, the 
coevolved C-terminal region did not contribute to 3TC 
resistance, including the previously identified N348I 
mutation at 4 months, neither did they contribute to the 
decreases in susceptibility to ABC, ddl or FTC (Figure 
3C and 3D and Table 3). However, we observed an 
effect of the C-terminal mutations at 37 months to 
AZT, with the co-evolved C-terminal region contribut- 
ing a 2.5-fold increase in AZT resistance (Table 3). 

Finally, we determined the effect of the mutations on 
susceptibility to NVP, the NNRTI used for first-line 
therapy in the CHAP2 cohort study. The recombinant 
viruses expressing the patient-derived C-terminal region 
at 4 months, but not at 17 or 37 months, exhibited a 5- 
fold increase in the NVP IC 50 value relative to wild-type 
(P < 0.002; Table 4). The decrease in NVP susceptibility 
associated with the C-terminal domain at 4 months is 
likely due to the presence of the N348I mutation in the 
connection subdomain which disappears at later time 
points. 

Connection subdomain mutations in patient P66 partially 
restore replicative fitness of Q151M MDR-containing 
viruses 

Since we did not observe any association of C-terminal 
mutations at 37 months with a decrease in susceptibilities 
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Table 2 Analysis of the frequency of accessory mutations in RTI-treatment naive and Q151M-containing sequences on 
Stanford University HIV database. 
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a The residue occurring in the majority of RTI-treatment naive patient sequences is referred to as wild-type. Codon positions showing statistically significant 
difference in mutation frequency between RTI-treatment naive and Q151M-containing sequences are indicated in bold. Subtype C: 131 (P = 0.033), A33 (P = 
0.024), T48 (P < 0.0001), S68 (P < 0.0001), D123 (P = 0.042), Q174 (P = 0.003), 1202 (P < 0.0001) and E203 (P = 0.011). Subtype B: 131 (P < 0.0001), A33 (P = 0.024), 
S68 (P < 0.0001), K102 (P = 0.006), 1135 (P = 0.029), Q197 (P = 0.015), 1202 (P < 0.0001), E203 (P < 0.0001), T386 (P < 0.0001) and T403 (P = 0.018). 
Sequences containing the Q151M mutation 

The number of sequences used for the analysis. Only one sequence was used per individual if multiple sequences were available. 

d The percentage of sequences with an amino acid change from wild-type residue. The mutant amino acid(s) present at a frequency greater than 1% is shown in 
brackets. 

e The difference in mutation frequency between Q151M-containing and RTI-treatment naive sequences; plus sign indicates an increase and minus sign a decrease 
in mutation frequency in Q151M-containing sequences compared to RTI-treatment naive. 
f NC = Not calculated (one sequence available for analysis). 
9 NA = Not applicable (no sequences available for analysis). 



to first-line drugs, we evaluated their effect on virus repli- 
cative capacity by infecting HEK293T cells with equiva- 
lent amounts of virus. The patient's sample before 
initiation of therapy was not available, thus the replicative 
capacity of the viruses measured by relative luciferase 
light units was compared to that of the virus expressing 
full-length patient-derived RT at 4 months. The patient- 
derived RT at 4 months had already developed the 
M184I mutation which is known to affect viral replicative 
fitness [35,36]. The virus expressing the full-length 
patient-derived RT containing the Q151M mutation at 



37 months demonstrated -42% replicative capacity of 
full-length patient-derived RT at 4 months (P < 0.0001; 
Figure 2E). This was further significantly decreased to 
-22% (P < 0.0001) when the patient-derived DNA pol 
domain at 37 months was expressed in combination with 
wild-type connection subdomain and RNase H domain. 
This decrease in replicative capacity was fully compen- 
sated (to -55% replicative capacity) by the coexpression 
of the coevolved connection subdomain at 37 months. In 
contrast, replicative capacity of the full-length patient- 
derived RT at 17 months was comparable to that at 4 
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Figure 3 NRTI susceptibilities and replicative capacity associated with RT domains of patient P66. (A) Schematic representation of full- 
length and chimeras of subtype C wild-type and patient-derived RT gag-pol expressing vectors used for drug susceptibility and replicative 
capacity testing. The positions of the restriction sites used for cloning of patient-derived PR-RT fragments (Apal and Clal) and for RT domain 
swapping (Hpal and Spel) are indicated above the vector. The origins of the RT domains are shown as different coloured boxes: black, wild-type 
virus; dark gray, patient-derived RT at 4 months; light gray, patient-derived RT at 17 months; and white, patient-derived RT at 37 months. The 
names of the vectors are indicated on the right with a number representing the month when the sample was collected followed by the 
patient-derived domain(s) being expressed. Mutations present in each domain are shown on the full-length RT constructs as follows: inside the 
box, NRTI-associated resistance mutations; above the box, NNRTI-associated resistance mutations; and below the box, other mutations. Pol, DNA 
pol domain; Cn, Connection subdomain; Rh, RNase H domain. (B) Susceptibility to d4T exhibited by patient-derived full-length RTs and RT 
domains. (C) Susceptibility to second-line NRTI ABC exhibited by patient-derived full-length RTs. (D) Susceptibility to second-line NRTI ddl 
exhibited by patient-derived full-length RTs. (E) Susceptibility to TDF exhibited by patient-derived full-length RTs. (F) Replicative capacities relative 
to virus expressing full-length patient-derived RT from 4-months after initiation of therapy, set at 100%, are shown for each virus. The error bars 
represent standard error of the mean of three or more independent experiments. 



months. This suggests that the Q151M mutation, as well 
as being the main determinant of drug resistance in the 
Q151M MDR complex, also has a more significant effect 
on virus replication fitness that is partially restored by 
mutations in the connection subdomain. 



Discussion 

Multiple mutations throughout HIV-1 RT are associated 
with RTI resistance including recently identified muta- 
tions in the connection subdomain and RNase H 
domain [10,21,27]. However, there are few data on 
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Table 3 3TC, AZT and FTC susceptibilities associated with 
RT domains of patient P66. 



Virus 


3TC 




AZT 




FTC 




ic 50 a 


FC b 


ic 50 a 


FC b 


IC 50 a 


FC b 


Wild- 


8.5 ± 0.8 




168.6 ± 46.8 




2.2 ± 0.3 




type 














4-RT 


1 98.8 ± 1 8.6 


23.3 


76.9 ± 6.8 


0.5 


184.2 ± 14.2 


84.9 


4-Pol 


211.5 ± 17.5 


24.8 


60.0 ± 1 3.8 


0.4 


168.1 ± 6.4 


77.5 


17-RT 


224.1 ± 16.9 


26.3 


56.4 ± 7.2 


0.3 


228.8 ± 6.7 


105.5 


1 7-Pol 


206.5 ± 9.7 


24.2 


58.1 ± 14.2 


0.3 


218.9 ± 13.3 


100.9 


37-RT 


219.7 ± 7.5 


25.8 


5120.9 ± 515.6 


30.4 


230.9 ± 1 0.2 


106.4 


37-Pol 


217.8 ± 18.1 


25.6 


2025.3 ± 144.2 


12.0 


231.5 ± 17.1 


106.7 



a 50% inhibitory concentration in nM ± SEM. 
b Fold change in IC 50 compared to wild-type virus. 



sequential acquisition and genetic linkage of these muta- 
tions and their impact on drug susceptibility and repli- 
cative capacity, especially in non-B subtype HIV-1 
viruses which account for nearly 90% of the epidemic 
worldwide [37]. In this study, we took advantage of 
treatment failure in the absence of viral load-guided 
therapy to dissect the relative contribution of RT 
domains in the route to high-level NRTI drug resistance 
through the Q151M pathway. 

As expected we found that the development of muta- 
tions was broad throughout RT. The virus from the 
patient we investigated had developed more than 12 
known drug resistance mutations and 20 additional 
mutations in RT, nearly half of which were located in 
the connection subdomain. A refined analysis of the 
emergence and development of these mutations in 
sequential samples by SGS revealed a chronological 
increase in frequency that paralleled the sequential 
acquisition of Q151M MDR mutations. In addition, the 



Table 4 NVP susceptibilities associated with RT domains 
of patient P66. 



Virus 


ic 50 a 


FC b 


Wild -type 


86.47 ± 1 1.84 




4-RT 


>6,000 


>66 


4-Pol 


>6,000 


>66 


4-Pol-Cn 


>6,000 


>66 


4-Cn-Rh 


410.5 ± 55.2 


4.7 


17-RT 


>6,000 


>66 


1 7-Pol 


>6,000 


>66 


17-Pol-Cn 


>6,000 


>66 


17-Cn-Rh 


73.83 ± 8.54 


0.9 


37-RT 


>6,000 


>66 


37-Pol 


>6,000 


>66 


37-Pol-Cn 


>6,000 


>66 


37-Cn-Rh 


88.23 ± 12.95 


1.0 


a 50% inhibitory concentration in nM ± SEM. 




b Fold change in IC 50 compared to 


wild-type virus. 





analysis showed genetic linkage of most of these muta- 
tions to Q151M MDR mutations indicating an associa- 
tion between the two. Although our results are from 
one patient, the identified mutations in the pol domain 
at codon positions 68 and 202 were previously identified 
in patients infected with subtype B HIV-1 viruses 
[12,28] and in an HIV database sequence analysis done 
in this study (Table 2). The database sequence analysis 
also showed that the DNA pol domain mutations at 
codon positions 31, 33, 48, 102, 123, 135, 174, 197 and 
203 were significantly associated with Q151M in subtype 
B and/or C. 

We show that although the connection subdomain 
mutations were acquired in parallel with Q151M MDR 
mutations they were not directly associated with drug 
resistance but played a role in improving the replicative 
fitness of the Q151M-containing viruses. Our findings 
confirm previous reports showing that the Q151M-con- 
taining virus replicates poorly [13,14,38,39]. We clearly 
show that the patient-derived connection subdomain is 
important for improving the Q151M-containing virus' 
replicative fitness and is thus important for the develop- 
ment of the Q151M pathway. It will be interesting to 
elucidate the particular mutations involved and the 
mechanism behind the connection subdomain's effect 
on replicative fitness of the Q151M-containing RT. The 
mutation at connection subdomain codon positions 386 
and 403 were significantly associated with Q151M in 
the subtype B database analysis; however, a similar ana- 
lysis could not be carried out for subtype C due to lack 
of samples sequenced beyond the DNA pol domain. 
Since the connection subdomain is involved in position- 
ing of the template-primer complex at the polymerase 
active site, one possibility could be that the mutations 
improve enzyme-substrate interactions at the active site. 
Of note, the intermediate Q151K or L mutations which 
have been postulated to be involved in the emergence of 
the Q151M mutation were never identified in our SGS 
analysis. It is possible that these mutations do emerge 
but are only present transiently due to their negative 
effect on replication and, as a result, were missed in this 
analysis. This possibility could not be explored further 
in this study as we were unable to amplify any genomes 
at 28 months, the time point prior to the emergence of 
the Q151M mutation. 

It was surprising to observe that the patient-derived 
connection subdomain and RNase H domain were not 
associated with the decreased susceptibility to NRTIs 
exhibited by the Q151M MDR-containing RTs and also 
that the N348I mutation disappeared prior to the acqui- 
sition of Q151M. As described earlier, N348I confers 
drug resistance by decreasing RNase H activity, thus it 
will be interesting to explore if a negative correlation 
exists between reduced RNase H activity and Q151M. 
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Another surprising finding was that full-blown resis- 
tance did not develop until 37 months after initiation of 
therapy, even though the viral load had been relatively 
high at earlier time points. This raises the possibility of 
suboptimal use of the drugs contributing to the emer- 
gence of the Q151M MDR complex. 

Conclusions 

Understanding the evolution and molecular mechanisms 
leading to the emergence of the Q151M MDR complex 
is important especially in light of its relatively frequent 
occurrence in some ARV rollout cohorts. As shown in 
this study and other previous reports [9], the presence 
of the Q151M mutation significantly limits the options 
for second-line therapies as the Q151M-containing virus 
remains only susceptible to one approved NRTI, TDF. 
Our results showed that the Q151M MDR takes a long 
time to develop and keeping patients on failing NRTI 
therapy could be facilitating its emergence. The Q151M 
MDR is also often linked to other NRTI and NNRTI 
mutations which develop earlier and thus further limit- 
ing the options for second-line regimens. In addition, 
the virus acquires compensatory mutations throughout 
RT which make it fitter, resulting in a virus that could 
persist even after switching to second-line therapy. This 
is a major obstacle in the developing world where fixed 
second-line therapies are composed of two alternate 
NRTIs (usually not TDF) and bPI. Thus, these types of 
studies are important in guiding public health 
approaches to the treatment and clinical management of 
HIV-1 infections in resource-poor settings. 

Methods 

Clinical HIV samples and database analysis 

The plasma samples characterized in this study were 
from a patient enrolled in the CHAP2 prospective 
cohort study at the University Teaching Hospital in 
Lusaka, Zambia [19]. Children in this study were 
initiated on first-line cART of 3TC/d4T/NVP (adult 
Triomune30) and, following immunological or clinical 
failure, were switched to a fixed second-line therapy of 
Abacavir (ABC)/ddI/Kaletra. The prevalence of identi- 
fied accessory mutations in clinical samples was ana- 
lyzed using the Stanford University HIV drug resistance 
database (http:/ /hivdb.Stanford.edu). 

SGS assay 

A previously described SGS assay [40] was modified by 
designing new antisense primers in integrase (IN) and 
used to sequence the full-length protease (PR) and RT 
genes from sequential samples. Briefly, viral RNA was 
extracted from 200 uL of plasma using QIAmp Ultra- 
Sens Virus Kit (Qiagen) following manufacturer's 
instructions and eluted in 60 uL of elution buffer. 



cDNA synthesis and single genome PCR reactions were 
carried out as described previously [40] using primers 
1849+ (5'-GATGACAGCATGTCAGGGAG-3') and 
4368- (5'-GCTAGCTACTATTTCTTTTGCTACT-3'), 
followed by a nested PCR with primers 1870+ (5'- 
GAGTTTTGGCTGAGGCAATGAG-3') and 4295- (5'- 
CTTTCATGCTCTTCTTGAGCCT-3'). Positive PCR 
products were identified by agarose gel electrophoresis 
and purified using illustra GFX PCR DNA and Gel Band 
Purification Kit (GE Healthcare), and sequenced by the 
dideoxy ABI sequencing systems in both directions 
using overlapping internal primers. Sequences were ana- 
lyzed using Sequencher software (Gene Codes) and 
aligned by using subtype-specific consensus sequences. 
Any sequences containing double peaks in the chroma- 
tographs were excluded. Drug resistance mutations were 
defined by using the Stanford University HIV drug resis- 
tance database. 

Phylogenetic analyses 

Full-length PR-RT nucleotide single-genome sequences 
from patient P66 and subtype-specific reference 
sequence MJ4 (subtype C) were aligned using Clustal W 
in MEGA4 software [41]. The aligned sequences were 
imported into PhyML tree building software and ML 
trees were constructed using the GTR model and the 
robustness of the trees was evaluated by bootstrap ana- 
lysis with 500 rounds of replication. 

Single-replication cycle drug susceptibility assay 

A recently described three plasmid-based retroviral vec- 
tor system using a luciferase reporter gene was used to 
study phenotypic drug susceptibility [42,43]. Briefly, vec- 
tor p8MJ4 was modified to accommodate RT domain- 
swapping by introducing three restriction enzyme sites, 
Hpal (flanking RT amino acids 288/289), Spel (flanking 
RT amino acids 423/424) and Clal (flanking IN amino 
acids 4/5) creating p8MJ4-HSC. The MJ4 sequence also 
contains a natural and unique Apal site in p6 region of 
gag. In addition, the Spel site in gag and two Clal sites 
(upstream of gag initiation codon and in gag) were 
eliminated to ensure that the introduced sites were 
unique. In parallel, patient-derived PR-RT single gen- 
omes that closely represented the sequence of the 
majority of the single genomes at each time point were 
subcloned into a TOPO-TA vector (Invitrogen) by PCR 
using primers GagApaF (5'-GCAGGGCCCCTAG- 
GAAAAAGGGC-3') and CRhlNClalRl (5'-CCTTATC- 
GATTCCATCTAGAAATAGC-3'). Similarly, Hpal 
(flanking RT amino acids 288/289) and Spel (flanking 
RT amino acids 423/424) sites were introduced and any 
Hpal or Spel sites that were present in the cloned 
patient fragments were removed using sequence-specific 
primers. Mutagenesis reactions were carried out by site- 
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directed mutagenesis using QuikChange Lightning Multi 
Site-Directed Mutagenesis Kit (Agilent Technologies) 
and the presence and absence of each mutation was ver- 
ified by sequencing. The other two vectors used in the 
system are pMDG encoding the vesicular stomatitis 
virus G protein and retroviral expression vector 
pCSFLW which encodes for the luciferase reporter gene. 
Virus stocks were prepared by cotransfection of 
HEK293T cells as described previously [44-46], diluted 
50- to 500-fold and used to infect HEK293T target cells. 
The virus and target cells were incubated with medium 
containing varying drug concentrations for 48 h. Infec- 
tivity was determined by measuring luciferase activity in 
the target cells using Steady-Glo reporter assay system 
(Promega). Data were expressed relative to that of no 
drug controls and the drug concentrations required to 
inhibit virus replication by 50% (IC 50 ) were determined 
by linear regression analysis. Results are expressed as 
fold changes in the IC50 compared to wild-type subtype 
C virus. 

Antiretroviral drugs 

The NRTIs ABC, AZT, ddl, emtricitabine (FTC), 3TC 
and d4T; and the NNRTIs efavirenz (EFV), etravirine 
(ETV), and NVP were obtained from the NIH AIDS 
Research and Reference Reagent Program. TDF was a 
generous gift from Gilead Sciences (Foster City, CA, 
USA). 

Replicative capacity Assay 

Recombinant viruses expressing wild-type and patient- 
derived RT domains were normalized for p24 capsid 
(Genetic Systems HIV-1 Ag EIA; Bio-Rad) and used to 
infect target HEK293T cells in a single-cycle-replication 
assay. Replicative capacity was determined by measuring 
luciferase activity as described above. 

Statistical analyses 

Student's t test was used to describe differences in IC 50 
values and replicative capacity and two proportions ana- 
lysis was performed by using Fisher's Exact test with P 
values < 0.05 regarded as significant for both tests (Sta- 
taSE software). 

Nucleotide sequence accession numbers 

The single-genome sequences generated and used in this 
study have been submitted to GenBank and assigned the 
accession numbers HQ111194-HQ1 11338. 
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